444 research outputs found
Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor
With the ease-of-programming, flexibility and yet efficiency, MapReduce has
become one of the most popular frameworks for building big-data applications.
MapReduce was originally designed for distributed-computing, and has been
extended to various architectures, e,g, multi-core CPUs, GPUs and FPGAs. In
this work, we focus on optimizing the MapReduce framework on Xeon Phi, which is
the latest product released by Intel based on the Many Integrated Core
Architecture. To the best of our knowledge, this is the first work to optimize
the MapReduce framework on the Xeon Phi.
In our work, we utilize advanced features of the Xeon Phi to achieve high
performance. In order to take advantage of the SIMD vector processing units, we
propose a vectorization friendly technique for the map phase to assist the
auto-vectorization as well as develop SIMD hash computation algorithms.
Furthermore, we utilize MIMD hyper-threading to pipeline the map and reduce to
improve the resource utilization. We also eliminate multiple local arrays but
use low cost atomic operations on the global array for some applications, which
can improve the thread scalability and data locality due to the coherent L2
caches. Finally, for a given application, our framework can either
automatically detect suitable techniques to apply or provide guideline for
users at compilation time. We conduct comprehensive experiments to benchmark
the Xeon Phi and compare our optimized MapReduce framework with a
state-of-the-art multi-core based MapReduce framework (Phoenix++). By
evaluating six real-world applications, the experimental results show that our
optimized framework is 1.2X to 38X faster than Phoenix++ for various
applications on the Xeon Phi
PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation
Controllable text generation has taken a gigantic step forward these days.
Yet existing methods are either constrained in a one-off pattern or not
efficient enough for receiving multiple conditions at every generation stage.
We propose a model-agnostic framework Plug-in Conditional Auto-Encoder for
Controllable Text Generation (PCAE) towards flexible and semi-supervised text
generation. Our framework is "plug-and-play" with partial parameters to be
fine-tuned in the pre-trained model (less than a half). Crucial to the success
of PCAE is the proposed broadcasting label fusion network for navigating the
global latent code to a specified local and confined space. Visualization of
the local latent prior well confirms the primary devotion in hidden space of
the proposed model. Moreover, extensive experiments across five related
generation tasks (from 2 conditions up to 10 conditions) on both RNN- based and
pre-trained BART [26] based auto-encoders reveal the high capability of PCAE,
which enables generation that is highly manipulable, syntactically diverse and
time-saving with minimum labeled samples. We will release our code at
https://github.com/ImKeTT/pcae.Comment: Knowledge-Based System
- …